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ABSTRACT 

The multidrug efflux pump MepA is a major con- 
tributor to multidrug resistance in Staphylococcus 
aureus. MepR, a member of the multiple antibiotic 
resistance regulator (MarR) family, represses mepA 
and its own gene. Here, we report the structure of a 
MepR-mepft operator complex. Structural compari- 
son of DNA-bound MepR with 'induced' apoMepR 
reveals the large conformational changes needed 
to allow the DNA-binding winged helix-turn-helix 
motifs to interact with the consecutive major and 
minor grooves of the GTTAG signature sequence. 
Intriguingly, MepR makes no hydrogen bonds to 
major groove nucleobases. Rather, recognition- 
helix residues Thr60, Gly61, Pro62 and Thr63 make 
sequence-specifying van der Waals contacts with 
the TTAG bases. Removing these contacts dramat- 
ically affects MepR-DNA binding activity. The wings 
insert into the flanking minor grooves, whereby 
residue Arg87, buttressed by Asp85, interacts with 
the 02 of T 4 and 04' ribosyl oxygens of A 23 and 
T 4 . Mutating Asp85 and Arg87, both conserved 
throughout the MarR family, markedly affects 
MepR repressor activity. The His14:Arg59 and 
Arg10':His35:Phe108 interaction networks stabilize 
the DNA-binding conformation of MepR thereby 
contributing significantly to its high affinity 
binding. A structure-guided model of the MepR- 
mepA operator complex suggests that MepR 
dimers do not interact directly and cooperative 
binding is likely achieved by DNA-mediated 
allosteric effects. 



INTRODUCTION 

The multiple antibiotic resistance regulator (MarR) family 
is a large protein family of bacterial transcription factors 
(1—4). It was named after Escherichia coli MarR, a tran- 
scription repressor of the marRAB operon, which 
produces MarA, the product of which triggers a multiple 
antibiotic resistance (mar) phenotype that is characterized 
by insusceptibility of the bacterium to a diverse repertoire 
of antibiotics, such as tetracycline, chloramphenicol, (3- 
lactams, puromycin, nalidixic acid and flouoroquinolones 
(3,5-8). Other members of the MarR family have been 
shown to regulate the bacterial response to oxidative 
stress (9-12), expression of virulence determinants (2,13- 
17), multidrug resistance (3,4,18-27), transport of transi- 
tion metals across the membrane (28) and catabolism of 
aromatic xenobiotics (29-31). Most MarR transcription 
regulators are repressors, whereby repression is achieved 
by binding of one or more protein dimers to the operator 
of the regulated gene. The binding site of the repressor 
often overlaps with crucial promoter elements (the -35 
box, the -10 hexamer or the transcription start site 
(TSS)) thus inhibiting productive RNA polymerase 
binding. Some MarR-type repressors might affect tran- 
scription by competing for DNA binding with a transcrip- 
tion activator and conversely some representatives of the 
MarR family may act as activators by competing for 
binding with a repressor (2). 

To date, only six structures of MarR family members 
bound to their cognate DNA are available. These include 
Bacillus subtilis OhrR (32), Salmonella enterica SlyA (33), 
Mycobacterium tuberculosis MosR (11), Yersinia pseudo- 
tuberculosis RovA (13), Streptomyces coelicolor SCO3205 
(34) and Sulfolobus tokodaii ST 17 10 (22). Five assume 
a similar binding mode, in which the recognition helices 
of the winged helix-turn-helix (wHTH) domains from 
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both subimits slide into consecutive major grooves, while 
the wings insert into the flanking minor grooves. Binding 
specificity is achieved, mostly, by direct reading of the 
major groove bases by amino acid residues of the recog- 
nition helices and, to a lesser extent, via reading of the 
minor groove bases by the wings. Interestingly, the struc- 
ture of the ST1710-DNA complex reveals a very different 
organization, in which only the wing motifs interact with 
DNA specifically via bases in the minor grooves, whereas 
the recognition helices neither insert into the major 
grooves nor establish specific, direct interactions with 
DNA bases (22). 

MepR is a member of MarR family of transcriptional 
regulators. MepR represses the transcription of mepA, 
which encodes a multidrug resistance efflux pump belong- 
ing to the family of multiple antibiotics and toxin 
extrusion (MATE) pumps, as well as the transcription of 
its own gene (19,21,23). MepR abolishes transcription by 
binding to operator regions, which harbour inverted 
repeats containing a GTTAG signature sequence. The 
mepR operator contains one inverted repeat, extending 
over the -10 hexamer and TSS, whereas the mepA 
operator contains two such MepR binding sites, which 
encompass the -35 and -10 elements and the TSS (19). 
The presence of two abutting MepR binding sites in the 
mepA operator region implies that MepR might bind to 
the mepA operator cooperatively. Repression is relieved 
upon the binding of MepA substrates to MepR followed 
by dissociation of the MepR DNA complex. A typical 
MepA/MepR substrate is a hydrophobic, very often 
aromatic, positively charged molecule. Interestingly, the 
repertoire of substrates of MepR and MepA overlap, 
but are not identical (18-21). 

In order to gain a fuller understanding of the repression 
and DNA recognition mechanisms of MepR, we 
determined the structures of a MepR-mepR operator 
DNA complex and characterized the amino acid residues 
directly and indirectly crucial for specificity and affinity of 
DNA binding using isothermal titration calorimetry (ITC) 
and electrophoretic mobility shift assays (EMSA) and 
in vivo by P-galactosidase assays. The MepR-DNA struc- 
tures reveal a heretofore-unseen mechanism of binding site 
recognition by a MarR protein. Further, the MepR-mepR 
operator complex structure allowed the building of a 
model of the MepR-mepA operator complex in order to 
understand better the underlying mechanism of its posited 
cooperative binding. 

MATERIALS AND METHODS 

Cloning, mutagenesis, protein expression and purification 

MepR with an N-terminal cleavable hexa-histidine tag was 
used for crystallization (see Birukou et al. (35) for details 
of its preparation). In most ITC experiments, EMSA and 
P-galactosidase assays recombinant MepR containing a 
non-cleavable C-terminal hexa-histidine tag was used. 
The plasmid used for its expression has been described pre- 
viously (19). Protein expression was achieved using strain 
C41 DE3 (Lucigen) or BL21 DE3 One Shot (Invitrogen). 
The cell growth protocol was described by Kumaraswami 



et al. (21). Protein purification was conducted according 
to the protocol described previously in Birukou et al. 
(35). All mutagenesis was carried out using standard 
protocols and QuikChange™ Site-Directed Mutagenesis 
(Agilent Technologies). Selenomethionine-substituted 
(SeMet) MepR was expressed using the methionine inhibi- 
tory pathway protocol (36) and purified as described for 
native MepR. The protein concentration was determined 
using UV spectroscopy and a calculated extinction 
coefficient s ?8 ° = 8.94 mM _1 cm (determined using 
ProtParam (37)). 

Crystallization, structure determination and refinement of 
the MepR-DNA complex 

Crystallization of the complex was carried out using the 
hanging drop-vapour diffusion approach. The DNA 
strands were annealed at 95°C for 5min and cooled 
slowly to room temperature on the bench. The protein- 
DNA complex was prepared in advance by mixing 
concentrated MepR with 1.2-fold molar excess dsDNA 
to obtain 0.375-0.5 mM MepR-DNA complex solutions. 
Oligodeoxynucleotides of different length and sequence 
were tried in crystallization experiments. The DNA, that 
yielded diffraction-quality crystals of the MepR-DNA 
complex was the palindromic blunt-ended 24 base pair 
(bp) oligodeoxynucleotide, 5'-ATT TAG TTA GAT 
ATC TAA CTA AAT-3'. 

One MepR-mepR operator complex crystallized in 
space group PI and was obtained by mixing the MepR- 
DNA complex with mother liquor containing 0.14M 
CaCl 2 , 0.7 M Na acetate, pH 4.6, 14% 2-propanol and 
30% glycerol in a 1:1 (v:v) ratio. The crystal diffracted 
to 2.84 A resolution. X-ray diffraction intensities were col- 
lected at ALS beamline B.L. 5.0.2, wavelength = 1.000 A. 
Data were processed using HKL3000 (38). Relevant par- 
ameters of data collection, integration and scaling are 
provided in Table 1. The structure of the WlzpR-mepR 
operator complex was solved via molecular replacement 
(Phaser-MR in Phenix, (39)) using the structure of the 
B. subtilis OhrR-DNA complex (PDB accession code 
lz9c, (32)) as a search model. Three MepR-DNA 
complexes were located in the asymmetric unit. After cor- 
recting the protein and DNA sequences and initial manual 
model building in Coot (40), the structure was refined in 
Phenix applying multiple rounds of positional, B-factor 
and Translation-Libration-Screw (TLS) refinement, 
interspersed with manual model rebuilding in Coot. 
Non-crystallographic symmetry (NCS) restraints were 
not imposed during refinement. The final model was 
refined to an i? WO rk = 20.6% and ^ free = 24.1%. 

The second crystal form took space group P3i and was 
obtained from solutions of 0.04 M Mg acetate, 0.05 M Na 
cacodylate, pH 6.0 and 30% (±)-2-methyl-2,4- 
pentanediol. The crystal diffracted to 3.0 A resolution. 
X-ray diffraction intensities were also collected at ALS 
beamline B.L. 5.0.2 (wavelength = 1.000 A). Data were 
integrated and scaled using HKL3000. Relevant param- 
eters of data collection and processing are provided in 
Table 1. The initial model of the P3i MepR-DNA struc- 
ture was obtained by molecular replacement using the 
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Table 1. Selected X-ray intensity data collection, structure solution and refinement statistics 
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structure of the MepR-DNA complex obtained from the 
PI crystal. Three MepR-DNA complexes were found in 
the asymmetric unit. However, after initial refinement the 
difference (mF D -DF c ) electron density map contained 
numerous positive peaks suggesting that the model was 
incomplete. Thus, in order to obain de novo phase infor- 
mation we crystallized a MepR-DNA complex using 
SeMet-labelled MepR in the P3i space group using the 
conditions for the native protein-DNA complex. SeMet- 
MepR-DNA crystals diffracted only to 3.66 A resolution. 
The X-ray diffraction data were collected at ALS beamline 
B.L. 5.0.2, using a wavelength of 0.979 A, corresponding 
to the Se absorbance peak. The structure was solved using 
SAD phasing in combination with molecular replacement 
(SAD-MR in AutoSol, Phenix). The previously solved 
MepR-DNA complex was used as a search model. 
In total, 40 Se atoms were located in the asymmetric 
unit, which corresponds to 4 MepR dimers. Density 
Modification in Resolve (Phenix) resulted in a figure of 
merit of 0.58 and produced interpretable electron density 
maps that allowed us to correct the initial molecular 
replacement model and build a fourth MepR dimer. The 
complete model was refined further in Phenix using the 
native, higher resolution (3.0 A) data. NCS restraints 
were not imposed on the equivalent protein and DNA 
chains during refinement of either structure. The final 
model has an R wolk = 19.4% and R free = 23.4%. 



Selected crystallographic statistics are provided in 
Table 1. The analysis of DNA conformation was done 
using Curves+ (41). All structural alignments were 
performed in Coot (40) and PyMol (The PyMOL 
Molecular Graphics System, Version 1.2r3pre, 
Schrodinger, LLC) and figures of the crystal structures 
were prepared in PyMol. 

Isothermal titration calorimetry 

All binding reactions were carried out at 25°C using a VP- 
ITC calorimeter (Microcal). Data were collected and 
analysed using the manufacturer supplied software 
package, Origin 7.0 (OriginLab Corp.). Each ITC run 
included an initial 4ul injection followed by thirty 10 ul 
injections of the MepR solution into the sample cell con- 
taining DNA (total volume = 1.45 ml). In some experi- 
ments DNA was injected into the cell containing the 
MepR solution. The experimental details of each experi- 
ment are provided in the relevant figure legends. DNA was 
purchased from IDT and annealed by heating at 95° C for 
5min followed by slow cooling to room temperature on 
the bench. The buffer used in all ITC experiments was 
20 mM Tris HC1 pH 7.5 (room temperature), 150mM 
NaCl and 5mM MgCl 2 . The fitted thermodynamic 
parameters as well as oligodeoxynucleotide sequences 
used in these experiments are provided in Supplementary 
Table SI. 
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Electrophoretic Mobility Shift Assay 

EMSAs employing wild type (WT) MepR or the 
MepR(R79A) mutant and the WT mepR operator site 
(167 bp) were performed at least twice essentially as 
described, modified only by elimination of salmon sperm 
DNA as a non-specific competitor (19). 

P-Galactosidase assay 

Derivatives of mepR were cloned into plasmid pALC2073, 
placing them under control of the tetracycline-inducible 
xyl/tetO promoter, and then were transduced into the 
proper host strains (42,43). These included SA-K2908 or 
SA-K2916, which contain mepA::lacZ and mepR::lacZ 
transcriptional fusions, respectively (19,20). The in vivo ef- 
ficiency of MepR derivatives in repression of chromosomal 
mepR or mepA expression was quantitated using a fluores- 
cence-based (3-galactosidase assay. This procedure has been 
described previously, and utilizes triplicate cultures of test 
organisms growing from exponential to stationary phase 
over a 10-h period. P-Galactosidase activity, which correl- 
ates with expression of the gene to which lacZ is transcrip- 
tionally fused, was quantitated using duplicate aliquots 
removed at specified time intervals (every 30min from 1 
to 5h, then at 6, 8 and lOh) (19). All strains were grown 
in the absence and presence of 0.05 u-g/ml of tetracycline to 
quantitate the effect of induction of plasmid-based mepR 
expression. A control strain expressing WT MepR also was 
induced in the same manner and examined concurrently. (3- 
Galactosidase activity was determined using an FLx800 
fluorescence microplate reader (BioTek Instruments Inc., 
Winooski, VT). Cumulative mepR or mepA expression was 
determined by integrating the area beneath expression 
curves with SigmaPlot 12.0 (Systat Software Inc., 
Chicago, IL). All P-galactosidase assays were replicated a 



minimum of three times and data were normalized to the 
activity of WT MepR at the mepA or mepR operators in 
SA-K2908 or SA-K2916, respectively, which were 
considered to be 100%. 



RESULTS AND DISCUSSION 

The structure of MepR in complex with the mepR 
operator DNA 

As a prelude to understanding the importance of the mepR 
operator sequence in the overall DNA-binding affinity 
of MepR and our effort to crystallize a germane MepR- 
mepR operator complex, we carried out several MepR- 
DNA binding assays using ITC. WT MepR binds to a 
27 bp mepR operator site containing two 2-fold related 
GTTAG signature sequences that are separated by a 
4 bp linker with a K d of ~36nM (Figure 1), which is 
similar to that previously reported, 24 nM, determined 
using a fluorescence polarization-based DNA-binding 
assay (21). Unfavourable positive enthalpy (AH = 
8.73 kcal/mol) of the reaction along with large favourable 
entropy (AS = 63.3 cal/mol*deg, and corresponding 
TAS = 18.86 kcal/mol) clearly indicates that the binding 
is entropy driven (Supplementary Table SI). In protein- 
DNA complexes, two physical phenomena are responsible 
for the observed thermodynamic parameters. One is the 
polyelectrolyte effect, which originates from the displace- 
ment of DNA phosphate backbone-associated cations by 
positively charged amino acid residues of a partner protein 
and causes a large entropy increase. The second is the 
hydrophobic effect, which results from the burial of 
non-polar surfaces upon protein-DNA complex forma- 
tion and concomitant release of structured water 
molecules from the hydration shell of a protein and 
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Figure 1. MepR binding to the mepR operator. ITC thermograms and binding isotherms for four MepR-DNA complexes. In the first and fourth 
binding experiment, protein (400 and 660 uM, respectively) was injected into DNA (24 and 15 uM, respectively); a stoichiometry of 2 corresponds to 
1 dimer of MepR binding to dsDNA. In the second and third experiments, DNA (250 and 400 uM, respectively) was injected into protein (49 and 
75 uM, respectively); a stoichiometry of 0.5 corresponds to a single dsDNA molecule binding to a dimer of MepR. The sequences of the oligodeox- 
ynucleotides and fitted thermodynamic parameters are provided in Supplementary Table SI. The data for the 14 bp sequence and 27 bp DNA site 
with a scrambled recognition helix-binding region demonstrated no binding and were not fit. The molar concentrations of MepR in all experiments 
were presented per monomer. 
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major and minor grooves of a DNA, resulting in increases 
in the entropy and enthalpy (44). The thermodynamic par- 
ameters observed in our ITC experiments on MepR-mepR 
operator binding indicate that the prevalent binding 
interactions are non-specific electrostatic and non-polar. 

Reducing the length of oligodeoxynucleotide to 14 bp to 
include only the GTTAG signature sequence essentially 
abolished MepR binding (Figure 1). The addition of a 
TA dinucleotide to the 5' end and a GA dinucleotide to 
the 3' end of the 14 bp improved the affinity ~30-fold 
(ATd~2(iM, Figure 1), but was still 55-fold lower than 
the WT value. As demonstrated previously increasing 
the length further to 22 bp restored high affinity binding 
(21). To assess the importance of GTTAG signature 
sequence, we constructed a 27 bp mepR operator site, in 
which the central 14 bp core, which encompasses both sig- 
natures sequences, was scrambled. The resulting thermo- 
gram demonstrates no specific DNA binding (Figure 1). 

Using the binding data from our ITC and fluorescence 
polarization experiments and in order to crystallize a 
physiologically relevant MepR-mepR operator complex, 
we tested 10 different oligodeoxynucleotides of varying 
length and sequence. Multiple crystals were obtained, 
most of which diffracted to only 5 A resolution. 
However, two conditions produced crystals, which dif- 
fracted to ~3.0A resolution. One took the triclinic space 
group PI and a second took the trigonal space group P3]. 
The DNA in both these crystals was a blunt-ended 24 bp 
oligodeoxynucleotide. The sequence of the crystallized 
oligodeoxynucleotide was modified slightly from the 
native mepR operator sequence to 5'- TAT TTA GTT 
AGA CAT CTA ACG AAA TGG-3' (19) by removing a 
thymine from the 5' end and two guanines from the 3' end 
(removed bases are italicized), and by replacing cytosine 
12 and guanine 20 (both are shown in bold font) with 
thymines. The result is the palindromic 24 bp oligodeox- 
ynucleotide, 5'-A 1 T 2 T3T 4 A5G 6 T 7 T 8 A 9 G 1 oA 11 T 12 A 13 T 14 
Ci5T 16 A 17 A 18 Ci9T2oA2iA22A23T24-3', which allowed us 
to avoid potential statistical disorder of the DNA bases 
in the crystal structure. 

The unit cell of the triclinic crystal form contains 
three copies of the MepR-DNA complex. The final 
model demonstrates excellent stereochemistry with 



97.9% of the residues in the most favourable region and 
no residues in the disallowed region of the Ramachandran 
plot. The asymmetric unit of the trigonal crystal form 
contains three copies of the MepR-DNA complex as 
well as one additional MepR dimer bound to DNA non- 
specifically. This model also shows excellent stereochem- 
istry with 97.4% of all residues in the most favoured 
region and the remaining 2.6% in the additionally 
allowed region of the Ramachandran plot. 

Each MepR subunit is composed of six a helices and 
two P strands (Figure 2A). The N terminus of al (residues 
1-14), the C terminus of a5 (residues 111-119) and a6 
(residues 120-139) of both subunits of the MepR dimer 
form the dimerization domain. The wHTH DNA-binding 
domain is composed of the helices a2 (residues 31^14), a3 
(residues 48-58) and <x4 (the recognition helix, residues 
60-76), and the wing motif is formed by (3 strands pi 
(residues 77-81) and p2 (residues 89-93) and their con- 
necting loop (residues 82-88). The dimerization domain 
is connected to the DNA-binding domain via the linker 
region, which includes the C-terminal half of al (residues 
15-30) and the N-terminal portion of oc5 (residues 
95-110). 2F D -F C omit electron density maps for both the 
triclinic and trigonal crystal forms that encompass the 
N-terminus of the recognition helix and its major 
groove-binding site, which contains the GTTAG signa- 
ture motif, are shown in Supplementary Materials, 
Supplementary Figure SI. 

Structure of mepR DNA 

For the greater part, MepR binding to the mepR operator 
does not cause any dramatic changes in the DNA-binding 
site conformation when compared to ideal B-DNA 
(Figure 2A). We do not observe unusual base pairing or 
base expulsion. Conformational analysis of the DNA 
using the Curves+ web server (41) confirms that the 
DNA is predominantly in B-form. In total, six complexes 
from both MepR-DNA structures were analysed. 
Pairwise alignment of the DNA duplexes produced RMS 
deviations of 0.91-1.49 A for trigonal crystal form, and 
0.81-1.36 A for triclinic crystal form. The largest devi- 
ations, as expected, were localized at the termini of the 




Figure 2. Crystal structure of MepR bound to the mepR operator. (A) Cartoon representation of MepR bound to a 24 bp oligodeoxynucleotide 
containing the mepR operator site. One subunit of the dimer is coloured blue and the other green. The secondary structure elements of each protomer 
are labelled with one set of labels primed. (B) The DNA-binding domain of MepR interacting with the major and minor grooves of DNA. Each 
element of the MepR wHTH motif is labelled. 
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duplexes where the base pairing is the weakest due to the 
loss of 7i-stacking stabilization on one of the sides. The 
duplex DNAs are only slightly bent, typically 5-10°, with 
only one outlier showing ~23° smooth bend. o The average 
base pair twist is 33.8°, and the rise is 3.26 A per nucleo- 
tide. The minor groove becomes wider and shallower in 
the centre of the DNA sequence: the width and depth of 
the central part encompassing bases 9-16 are 8-9 A and 3- 
4 A, respectively. The corresponding parameters for ideal 
B-DNA are 4.2 and 4.9 A (41). The width of the major 
groove in contact with the recognition helices becomes 
slightly smaller (9-10 A versus 10.5 A in ideal B-DNA) 
but increases in the central part of the 24mer and at the 
ends (up to 13 A). Further, the major groove is deeper in 
the central part of the sequence, 7-8 A as compare to 5 o .4 A 
in B-DNA, but the depth decreases at the ends (5-7 A). 

Protein-DNA interactions in MepR-DNA complex 

Similar to other MarR family members, the in vivo mepR 
operator site is a pseudopalindrome, which we modified 
slightly to be a true palindrome. Helices od and otl' of the 
dimerization domain lie atop the central part of the minor 
groove and make contacts with the sugar-phosphate back- 
bone. Helices oc2, oc3 and oc4 of the wHTH domains of both 
subunits interact with the major grooves encompassing the 
signature sequence GTTAG, while the wings insert into 
the flanking AT-rich minor grooves (Figure 2A and B). 
The majority of the protein-DNA contacts are non- 
specific and occur between the wHTH domains and 
sugar-phosphate backbone of the DNA (Figure 3). Most 



of these interactions are hydrogen bonds or salt bridges 
between the phosphate groups and charged or polar 
amino acid side chains, including residues of the recogni- 
tion helix and its N-terminal turn (residues Arg59, Thr60, 
Thr63, Ser65, Arg69 and Asn70), wing residues (Arg79, 
Gln84, Thr86, Arg88 and Lys89), the N-terminal residues 
of a3 (Thr49, Gln50 and Asn51) and a2 (residues Thr30 
and Gln33), and residue Hisl4 of helix ocl. The positively 
charged dipoles of the N-termini of helices oc2 and oc3 in 
both subunits additionally stabilize the binding via elec- 
trostatic interactions with the negative phosphate 
backbone, which is common for most MarR-DNA struc- 
tures (11,13,30,32-34). There is also a multitude of van der 
Waals contacts from residues Glnl8, Glu32, Arg59, 
Thr60, Thr63, Asn70, Asp85 and Arg88 to the deoxyri- 
bose rings and phosphates. 

Interactions between the wings and minor grooves 

The MepR wings interact with the minor grooves via 
residues Arg87 and its dyadic mate Arg87' (Figure 4A). 
The guanidine group of Arg87 donates hydrogen bonds to 
the 02 atom of T 4 and 04' atoms of the deoxyriboses of 
A 2 3 and T 4 (Figures 3 and 4A). In some cases there is an 
additional weak hydrogen bond between Arg87 and the 
02 of T 3 . This residue also makes van der Waals contacts 
with the T 4 and A 2 2 bases (Figures 3 and 4A). At the same 
time the Arg87 side chain is buttressed by a salt bridge to 
the carboxylate group of residue Asp85, which in turn is 
hydrogen bonded to the guanidinium group of residue 
Arg88 (Figure 4A, inset). To determine the importance 



T49 



S65 




T86 



Palindrome 
sequence 



N70 



Q33 



H14 



Q32 



5'-jATTTAGTTAGArATCTAACTAAAT-3' 

3'-taaatcaatctaItagaugattta-5' 



3' 
5' 



mepR operator 5'- 7 477TAGTTAG^ C ATCTAAC G ^AA'JGG 
sequence 3'^r4/»ATCAATqG|rAGATTqGTTT/ 
-10 TSS 

Figure 3. Schematic representation of protein-DNA interactions in the MepR-mepR operator complex. Due to the symmetric nature of the complex 
only half of the 24 bp DNA is shown. Solid arrows indicate electrostatic interactions (hydrogen bonds or salt bridges) and dashed arrows indicate 
van der Waals contacts. The lower part of the figure compares the sequence of the palindromic oligodeoxynucleotide used in crystallization to the 
actual sequence of the mepR operator. The dashed box indicates the part of the DNA shown in the interaction diagram. The thin line boxes indicate 
the base pairs, which were changed or removed to obtain the palindromic sequence. The promoter elements (the -10 box and TSS) are shown in bold 
italics. The arrows above the sequence indicate the inverted repeats GTTAGNNNNCTAAC. 
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Figure 4. Interaction of the wing motif with the minor groove. 

(A) Residue Arg87 interaction with the 02 atoms of T 4 and T 3 , and 
the ribosyl oxygens of T 4 and A 2 3. The boxed inset shows the electro- 
static network between residues Arg87, Asp85 and Arg88. 

(B) Repression of the chromosomal mepR or mepA operators by 
MepR mutants D85S and R87A as determined by the ability of 
mutant MepR to affect the expression of the mepR::lacZ and 
mepA::lacZ transcriptional fusions. Data are presented as the mean 
repressor activity ± the standard deviation. The data were normalized 
to WT MepR expression and are provided in Supplementary Table S2. 

(C) Sequence alignment of the wing motif region of MarR proteins the 
structures of which have been solved in complex with cognate DNA. 
The conserved residues Asp85 and Arg87 are shown in green boxes. 
The alignment was created using ClustalW (45) and shaded using 
BoxShade 3.2. (D) MepR binding to the mepR operator with its 
wing-binding nucleotides modified. The sequences of oligodeoxynu- 
cleotides are indicated above the thermograms. In the left panel, 
DNA (400 uM) was injected into protein (55 uM); a stoichiometry of 
0.5 corresponds to a single dsDNA molecule binding to a dimer of 
MepR. In the right panel, protein (400 uM) was injected into DNA 
(20 uM); a stoichiometry of 2 corresponds to 1 dimer of MepR 
binding to one dsDNA. The fitted thermodynamic parameters are 
provided in Supplementary Table SI. 



of the Asp85-Arg87 pair for DNA binding, we created the 
MepR single mutants R87A and D85S and tested their 
ability to repress the expression of (3-galactosidase from 
chromosomal mepR::lacZ and mepA:.iacZ fusions 
(Figure 4B). Both mutations significantly impair the re- 
pressor function of MepR reducing its operator binding 
activity 5-10-fold in comparison to the WT protein, with 
the R87A change having a greater effect than the D85S 
substitution, especially at the mepA operator site which is 
not repressed at all by the MepR(R87A) protein. Similar 
defects in repressor activity were observed in MexR, SarA 
and MarR that contained substitutions of the correspond- 
ing arginine to alanine, histidine or cysteine (46^18). 

The structural configuration of the tip of the wing motif 
is conserved among MarR family members (Figure 4C). In 
most MarR family member-DNA complex structures 
residue Arg87 is found to favour a thymine at the ninth 
position from the dyad axis of the palindrome 
(11,13,30,32-34). We tested whether Arg87 indeed dis- 
criminates in favour of thymine in MepR-DNA complex 
using ITC. The replacement of thymines at the 3rd and 4th 
position by guanines dramatically increases the dissoci- 
ation constant of protein-DNA complex, from ~36 nM 
to ~2uM (Figure 4D). Such a marked change is 
associated with the unfavourable steric effect of the 
exocyclic 2-amino group of guanine, which likely causes 
the Arg87 side chain to relocate to a position outside the 
minor groove. In contrast, changing the thymines to 
adenines at the same positions is less detrimental as the 
binding affinity drops by only ~5-fold (Figure 4D). The 
absence of an exocyclic group at position 2 of adenine still 
allows the insertion of the Arg87 side chain into the minor 
groove. Additionally, the 02 atom of T 2 2, which is com- 
plementary to A 3 , may accept a hydrogen bond from the 
amino group of Arg87 side chain. Replacing T 3 and T 4 
with cytosines should not disrupt MepR binding since 
both bases possess an 02 atom and can accept a 
hydrogen bond from Arg87. Indeed, such a replacement 
is present in the first MepR binding site of the mepA 
operator sequence (Figure 8), which exhibits an affinity 
similar to the mepR operator site (21). 

Interactions between the recognition helices and major 
grooves 

The N-termini of the recognition helices of MepR insert 
deeply into consecutive major grooves and interact with a 
number of bases. Intriguingly, these protein-DNA 
contacts are exclusively van der Waals interactions. 
Specifically, the side chain of residue Pro62 makes van 
der Waals contact with the exocyclic methyl group 
(distance, a! =3.6 - 4.0 A) and the 04 atom (<i= 3.8 A) 
of T 7 , with the amino group of A 17 (d = 3.5 A), with the 
04 atom of T g (d = 3.2 A), and with the 04 atom and C7 
methyl group of T 16 (d =3.5 and 4.0 A, respectively) 
(Figures 3 and 5A). Additionally, the carbonyl group of 
residue Gly61 interacts with the methyl group of T 7 
(d = 3.7 A), the hydroxyl of the o Thr60 makes contact 
with Tig methyl group (d = 3.4 A) and Thr63 hydroxyl 
group interacts with the C5 atom of C i5 (d=3.6A) 
(Figures 3 and 5A). Thus, the major groove is devoid of 
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base-specifying protein-DNA hydrogen bonds. Such 
hydrogen bonds are typical for protein-DNA complexes 
involving transcription factors that contain helix-turn- 
helix motifs and are observed in the structures of most 
MarR-DNA complexes (Figure 5C). For example, in 
the S. enterica SlyA-DNA complex, residue Arg65 of 
the recognition helix directly reads the guanine of its AA 
TCG signature sequence (33). In B. subtilis OhrR bound 
to cognate DNA many more electrostatic contacts are 
found: Ser68 makes a direct hydrogen bond to N7 atom 
of an adenine in the signature sequence TAACAT; in 
addition, the carbonyl group of residue Gly69 hydrogen 
bonds to the exocyclic amino group of cytosine and 
residues Asp67 and Arg77 make water-mediated 
contacts with the bases of the major groove (32). 
Similarly, M. tuberculosis MosR interacts with its 
binding site via hydrogen bonds between residues Arg70 
and the first guanine, Thr71 and the cytosine complimen- 
tary to the second guanine, and Arg75 and the second 
guanine and the fourth thymine of the palindrome 
sequence GTGTA (11). More recently, S. coelicolor 
SCO3205 was shown to utilize hydrogen bonds between 
recognition-helix residues Arg72, Ser73 and His77 and the 
first, third and fourth bases of the ATTGA signature 
motif for specific binding (34). An interesting exception 
to these examples is observed in the crystal structure of 
the Sulfolobus tokodaii ST1710-DNA complex in which a 
dramatically different binding mode is observed whereby 
the recognition helix of one subunit lies above the sugar- 
phosphate backbone and does not insert into the major 
groove (22). 

The absence of hydrogen bonds and other electrostatic 
interactions between the residues of recognition helix and 



the bases of the major groove in the MepR-DNA complex 
is unusual and unprecedented for the MarR family. Often, 
electrostatic interactions can be water-mediated as was 
observed in the OhrR-DNA complex structure (32). 
We did not observe any electron density, which could 
be attributed to solvent molecules in the vicinity of the 
recognition helices. To exclude the possibility, that the 
relatively low resolution of our structures prevented us 
from observing protein-DNA contact-mediating solvent 
molecules, we created a series of MepR mutants, in 
which residues potentially capable of establishing water- 
mediated contacts with the bases of the major groove, 
including Thr63, Asn66, Arg69 and Asn70, were 
replaced by alanine. We tested the DNA-binding activity 
of these MepR mutants using P-galactosidase assay. None 
of these mutations affected the repressor function of 
MepR (Figure 5B, Supplementary Table S2), with the 
exception of T63A, which demonstrated an ~2-fold loss 
of repressor activity for the mepR operator in comparison 
to WT MepR, but no loss of activity at the mepA 
operator. In contrast, ITC results showed that the 
MepR(T63A) protein binds the mepR operator site with 
WT affinity (Supplementary Figure S2). The reasons 
behind the observed 2-fold loss of the in vivo repressor 
function of MepR(T63A) are not clear but might be due 
to a lower expression level or stability in the SA-K2916 
strain. 

Thus, the crystal structure of the MepR-DNA complex 
and in vivo P-galactosidase assays with the selected MepR 
recognition-helix mutants support the idea that the speci- 
ficity of MepR binding to cognate DNA is achieved solely 
via van der Waals interactions. Residues Thr60, Gly61, 
Pro62 and Thr63 appear to be the only structural 
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Figure 5. Interaction of the recognition helix (oc4) with the major groove. (A) Residues Thr60, Gly61, Pro62 and Thr63, shown as atom-coloured 
sticks, establish van der Waals contacts with the TTAG bases of the GTTAG signature sequence, shown as sticks and atom-coloured. The atoms 
involved in van der Waals interactions are shown as semi-transparent spheres with the carbon atoms of the DNA bases coloured beige and those of 
the amino acid residues coloured green. Key residues and bases are labelled. (B) Repression of the chromosomal mepR or mepA operator by MepR 
mutants T63A, N66A, R69A and N70A as determined by the ability of the mutant protein to affect the expression of mepRr.lacZ and mepA::lacZ 
transcriptional fusions. The data are presented as the mean repressor activity +/- the standard deviation. The data were normalized to MepR WT 
and are provided in Supplementary Table S2. (C) Sequence alignment of the recognition helices of selected MarR family proteins. Red circles indicate 
the residues interacting electrostatically with the major groove bases in their corresponding protein-DNA complexes and the blue boxes represent 
van der Waals contacts. The alignment was created using ClustalW (45) and shaded using BoxShade 3.2. 
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factors that discriminate in favour of the TTAG bases of 
the GTTAG signature sequence. To test further whether 
the observed van der Waals interactions indeed determine 
specificity of binding, we monitored WT MepR binding to 
DNA containing altered GTTAG half-site sequences. 
Therefore, we designed three mutated sequences, GTTT 
G, GATAG and GTAAG, which altered only one of 
the signature sequence bases and would eliminate 
their observed van der Waals interactions between the 



recognition helices and the major groove. These mutated 
sequences bound WT MepR with only marginal decreases, 
at most ~3-fold, in DNA-binding affinity (Figure 6). 
However, when all three mutations are combined to give 
the quintet, GAATG, the dissociation constant increases 
to 10 micromolar (Figure 6). Modelling of the substitution 
of the last guanine :cy to sine with an adeninelhymine pair 
(GTTAG to GTTAA) reveal the introduction of severe 
steric clash between the methyl group of the thymine 



Time (min) 
30 60 90 120 150 



Time (min) 
0 30 60 90 120 150 




0.50 

0.40 

0.30 

0.20 

0.10 

0.00 
8.00 

6.00 

4.00 

2.00 4 

0.00 



Time (min) 
0 30 60 90 120 150 



GTAAG 



K d = 77 nM 



Molar Ratio 



Molar Ratio 



Time (min) 

30 60 90 



120 



150 



Time (min) 
30 60 90 120 150 



0.80 

0.60 

8 0.40 
to 

^ 0.20 
0.00 
6.00 



t3 4.00- 



2.00 



0.00- 



n 1 1 1 1 1 1 1 r 



GAATG 



K d = 10 \xM 



2.50 
2.00 
1.50 

o 

ID 

« 1.00 

ro 

o 

^ 0.50 
0.00 
16.00 

c 

■g 12.00 

(D 

^ 8.00 



4.00 



0.00 



GTTAA 




1 2 3 
Molar Ratio 



12 3 4 
Molar Ratio 



Figure 6. MepR binding to mutated signature sequences of the mepR operator. Variants of the WT signature sequence GTTAG are shown with 
mutated nucleotides shown in bolded italics. The 'palindromized' 26 bp oligodeoxynucleotide 5 , -TATTTA GTTAG ATAT CTAAC TAAATA-3' was 
used as the template in which to make appropriate base pair changes in the signature sequence for binding studies. In all experiments, protein was 
injected into DNA. A binding stoichiometry of n = 2 corresponds to 1 dimer of MepR binding to one dsDNA. The fitted thermodynamic parameters 
are provided in Supplementary Table SI. In the first experiment, 200 uM MepR was injected into 10 uM dsDNA; in the second and third — 185 uM 
MepR was injected into 10 uM dsDNA; in the fourth — 452 uM MepR was injected into 20 uM dsDNA; and in the fifth — 412 uM MepR was injected 
into 20 uM dsDNA. 
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base and residue Thr63. As expected, this mutation com- 
pletely abrogates MepR binding. Thus, the recognition of 
specific DNA-binding sites by MepR is dictated by van 
der Waals interactions. Because this type of interaction is 
less discriminating and energetic than a typical hydrogen 
bond, more van der Waals contacts are necessary to 
achieve the required DNA specificity, which explains the 
small increases in K d for the single mutations of the TTA 
triplet of GTTAG motif and the necessity to introduce 
multiple changes to lower the binding affinity significantly. 
In accord, neither the single MepR mutation P62A, pre- 
viously reported (49), nor the single mutation T63A, 
described in this work, have affected the repressor 
function of MepR. In addition, MepR does not only 
discriminate in favour of a particular sequence, but also 
against some specific sequence alterations, as observed for 
the GTTAA and GAATG mutations (Figure 6). 

Distal protein-protein interactions stabilize the 
DNA-binding conformation of MepR 

A more detailed investigation of the DNA-bound form of 
MepR revealed two protein-protein interaction networks, 
which are absent from the apoMepR structure but we posit 
are important in stabilizing the DNA-binding conform- 
ation of MepR. One network involves the side chains of 
the residue ArglO' (where the prime indicates a residue 
from the other subunit of the MepR dimer), His35 and 
Phel08, which stack upon each other in a manner that 
is similar to DNA base stacking (Figure 7). In most of the 
independent copies of the MepR-DNA complex, the planes 
of the guanidine, imidazole and benzyl side chains are 
nearly parallel, and the distances between these planes 
are 3-4 A. The side chains of residues ArglO', His35 and 
Phel08 contain delocalized Tr-electrons and can participate 
in Ti-71 or cation-Tt interactions. In the apoMepR structure, 
residues ArglO', His35 and Phel08 do not interact and 
are located 7-10 A away from each other (Supplementary 
Figure S3B). To test whether this triplet interaction con- 
tributes to, or is absolutely necessary for specific DNA 
binding we created the single MepR mutants R10S and 
H35A and measured their DNA-binding affinities 
(Supplementary Figure S3A). Both MepR(RlOS) and 
MepR(H35A) displayed ~ 10-fold decreased affinities for 
the mepR operator (K d (R10S) = 300 nM and K A 
(H35A) = 420 nM). The second protein-protein inter- 
action network involves residues His 14' of al and Arg59 
found in the loop between helices a3 and a4 of the neigh- 
bouring subunit (Figure 7). The planes of the imidazole side 
chain of His 14' and the guanidinium group of Arg59 are not 
necessarily parallel to each other but all within 3^4 A, which 
again indicates 71-71 or cation-71 interactions similar to those 
in ArglO':His35:Phel08 triad. In apoMepR, residues 
His 14' and Arg59 do not interact: the closest His 14'- 
Arg59 approach is 13.5 A, (Supplementary Figure S3B). 
The effect of substituting His 14 with alanine on DNA- 
binding activity of MepR is identical to the R10A and 
H35A changes: an ~10-fold increase in the dissociation 
constant (K d = 380 nM) compared to WT MepR 
(Supplementary Figure S3 A). 




Figure 7. Intersubunit contacts buttress the DNA-binding conform- 
ation of MepR. Interacting residues of Hisl4':Arg59 and 
ArglO':His35:Phel08 are shown as sticks; the distances between the 
residues are indicated. The dissociation constants (K^) of MepR 
mutants H14A, R10S, H35A are provided. 



Beyond these interactions, residue Arg79, which is 
located in the middle of the (31 strand of the wing, also 
plays an important role in the DNA-binding activity of 
MepR. Residue Arg79 is involved in a network of electro- 
static interactions with residues Gln50 of <x3 and Glu72 of 
a4 (Supplementary Figure S3C), which helps to preserve 
the proper fold of the helix-turn-helix motif, as this 
network is present in both the apo and DNA-bound 
forms of MepR. The same Gln-Arg-Glu triad is found 
in the SlyA-DNA structure, whereas in the OhrR-DNA 
complex only the Arg-Glu interaction is observed. The 
MepR mutant in which residue Arg79 is substituted by 
alanine has very poor DNA-binding activity as evidenced 
by EMSA (Supplementary Figure S3D). 

The structure of a non-specific MepR-DNA complex 

In addition to the three independent specific MepR-DNA 
complexes that are seen in the asymmetric unit of the 
trigonal crystals, an additional MepR dimer is bound to 
DNA. However, unlike the specific complexes, this fourth 
dimer interacts with two different DNA molecules acting 
as a non-specific tether (Supplementary Figure S4A). The 
dimerization domain of the non-specifically bound MepR 
dimer is significantly disordered and due to poor electron 
density, residues 115-124 could not be built. Regardless, 
the overall configuration of the dimer is discernable and is 
distinct from the specific DNA-binding conformation of 
MepR. Indeed, the distance between the centres of the 
recognition helices is ~42 A, and the orientations of rec- 
ognition helices are substantially different from those of 
MepR in complex with DNA. As a result, the alignment of 
the dimer from the MepR-DNA complex with the non- 
specifically bound MepR dimer produces a striking 
RMSD of 6.5 A. Further, the conformation of the non- 
specific MepR dimer is also distinct from the apoMepR 
dimer reported by Kumaraswami et al. (21) (RMSD = 
4.9 A), which again demonstrates the great flexibility and 
adaptability of the MarR family members. In general 
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terms, almost all protein-DNA interactions observed in 
the non-specific complex are confined to the DNA 
backbone and involve many, but far fewer of the residues 
that also make DNA-backbone interactions in the specific 
complexes (compare Figure 3 with Supplementary 
Figure S4B). In contrast, the wing is still engaged in 
hydrogen bonds with minor groove bases but not those 
contacted in the specific complex (Figure 3, 
Supplementary Figure S4B). We propose that this 
MepR-DNA complex provides fortuitous insight into 
one of the non-specific DNA-binding modes of MepR 
and potentially other MarR family members. A more 
detailed description of the non-specific MepR-DNA 
complex is provided in the Supplementary Materials. 

Comparison of DNA-bound MepR with a DNA-free 
MepR 

Comparison of the structure of DNA-free MepR reported 
previously (21) to DNA-bound MepR demonstrates that 
significant conformational changes are required for 
specific DNA binding. Particularly, the wHTH domain 
of each MepR subunit undergoes ~45° rotation towards 
the ocl helix of the neighbouring subunit during complex 
formation (Supplementary Figure S5A). Such movements 
of the DNA-binding domains place the recognition helices 
~32 A apart (the distance from the Cot atoms of residues 
Leu68 and Leu68'), which is an ideal distance for the inter- 
action of the recognition helices with consecutive major 
grooves and the wings with the flanking minor grooves 
of B-DNA. Key to the observed conformational change 
is the distortion of the linker region such that a kink is 
created at position 103 of helix a5 and helix od is bent at 
position 16. Such kinking and bending of helices a5 an al 
in the specifically DNA-bound MepR corroborates our 
recent finding, based on a comparison of the apoMepR 
structure with the structure of a second ligand-free MepR 
dimer, which assumed a conformation more compatible 
with DNA binding and underscores the importance 
of flexibility of the MepR linker region in adopting a 
quaternary structure that is optimal for specific DNA 
binding (35). 

Comparison of the MepR-DNA complex to other 
MarR-DNA complexes 

The two crystal structures described herein provide six 
independent copies of the MepR-DNA complex. These 
align well within experimental errors with RMSDs 
between 0.3 and 0.6 A (here and below only the protein 
dimers of each complex were included in the alignment). 
The overall shape of the MepR-DNA complex is triangu- 
lar, which is similar to other structures of MarR family 
members bound to DNA (Supplementary Figure S5B, 
(11,13,30,32-34)), with the one exception, ST1710 (22). 
The structures of DNA-bound MepR dimers superimpose 
very well on the dimeric structures of DNA-bound OhrR 
(PDB accession code lz9c) (RMSD = 1.6 A) and DNA- 
bound SCO3205 (PDB accession code 3zpl) (RMSD = 
1.7 A) with the largest deviations in both alignments 
found in the dimerization domains. Also in SCO3205, 
the C-terminal tail is 12 residues longer than in MepR 



and extends into the DNA major groove, where its 
terminal residue, Argl63, interacts with residue Arg72 of 
the recognition helix, which in turn makes a sequence- 
specific contact to A 5 or a backbone contact with the 
phosphate group of G 4 (34). A significantly worse align- 
ment is observed with DNA-bound SlyA dimer (PDB ac- 
cession code 3q5f) (RMSD = 2.6 A), in great part because 
the DNA-binding domains are farther apart in the SlyA- 
DNA complex. However, when individual MepR subunits 
are aligned with subunits of SlyA from the SlyA-DNA 
complex, the RMSD diminishes to 1.0 A. Structural align- 
ment of the DNA-bound forms of dimeric MepR and 
MosR (PDB accession code 3gfi) also reveals a fairly 
large RMSD (2.3 A). Similar to OhrR, the largest discrep- 
ancy is concentrated in the dimerization domain, while the 
wHTH motifs are structurally conserved. Finally, the 
structure of ST1710-DNA complex (PDB accession code 
4fx4) stands out due to its unusual mode of interaction 
with DNA: the sequence-specific interactions are found 
only between the wing motifs and the minor grooves, 
whereas at the same time, the recognition helices provide 
only non-specific contacts to the sugar-phosphate 
backbone of the DNA and do not interact with the 
major grooves. The resulting superposition of a DNA- 
bound MepR dimer and & DNA-bound ST1710 dimer 
results in an RMSD = 4.7 A. Thus with the exception of 
ST1710, the MarR family members bind their cognate 
DNA sites using similar, but not identical conformations 
with their major structural differences confined primarily 
to the dimerization interfaces. 

A model of MepR bound to mepA operator 

The mepR operator contains a single inverted repeat and, 
thus, binds only one MepR dimer (Figure 3, (21)). In 
contrast, the mepA operator sequence encompasses two 
inverted repeats, which allows binding of two MepR 
dimers (Figure 8, (21)). Inspection of the mepA operator 
sequence reveals that the individual MepR binding sites 
are essentially adjoining as only a single T-A base pair 
separates the 3' end of the 1st site from the 5' end of the 
2nd site (Figure 8). Such close positioning of the binding 
sites raises the possibility that MepR dimers interact co- 
operatively to bind to the mepA operator. To investigate 
this possibility, we used the structure of MepR-mepR 
operator site to model the MepR-mepA complex 
(Figure 8). The model demonstrates that the MepR 
dimers are positioned on nearly opposite faces of the 
DNA: the 2-fold axes of the two dimers create a 
dihedral angle of ~130° with the DNA axis. 
Interestingly, no interactions between MepR dimers 
could be inferred from the model, which at first glance 
would imply that MepR binds the mepA operator sites 
independently, in a non-cooperative fashion. 

To investigate the thermodynamics of MepR binding to 
the mepA operator and to ascertain whether or not such 
binding is cooperative, we carried out a series of DNA- 
binding experiments. MepR binding to the WT mepA 
operator is characterized by a single sharp transition 
(Figure 9). The stoichiometry of binding (n = 4) indicates 
that two dimers of MepR interact with one DNA duplex, 
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mepA 5'-TATAGTTAGATATCTAATTGTTAGTAAGCTAATTATTGGA-3' 
operator 3'-ATATCAATCTATAGATTAA CAATCATTCGATTA ATAACC-5' 

Inverted repeat 2 

Figure 8. A structure-guided model of MepR bound to the mepA operator. The sequence of the mepA operator is shown with key promoter elements 
highlighted (the -35 and -10 boxes are shown in red font and the TSS in blue). Two MepR binding sites (Inverted repeats 1 and 2 corresponding to 
the 1st and the 2nd sites, respectively) are indicated by arrows. 
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Figure 9. MepR binding to the mepA operator. In all experiments, MepR was injected into dsDNA. The DNA concentration was 10 uM, the 
concentration of MepR in the first experiment was 272 uM, in the second and fourth, 200 uM, and in the third, 370 uM. A stoichiometry of 
4 corresponds to 2 dimers of MepR binding to a single dsDNA molecule; a stoichiometry of 2 corresponds to 1 dimer of MepR binding to a 
single dsDNA molecule. In MepR binding to the 46 bp mepA operator in which the 2nd site is scrambled, only the first transition was fit. The 
sequences of the oligodeoxynucleotides and fitted thermodynamic parameters are provided in Supplementary Table SI. 



and the fitted K d (42 nM) is very close to the K d of MepR 
binding to the mepR operator (36 nM). The thermogram is 
more complex for MepR binding to the 46 bp mepA 
operator in which the second binding site was scrambled 
(Figure 9). Specifically, a second less steep transition 
appears. We assigned the first transition, which displays 
a stoichiometry of 2 (1 MepR dimer per 1 DNA duplex) to 
MepR binding to the intact 1st mepA operator site. The 
second transition likely represents a binding mode 
observed for the non-specific MepR-DNA complex 
(Supplementary Figure S4). Interestingly, when the 1st 
site of the mepA operator is destroyed, MepR binding to 
the intact 2nd mepA site is dramatically impaired 



(,K d = 3.8uM, n = 2.32, which implies the binding of 
1 MepR dimer per one DNA duplex; Figure 9). These 
data corroborate previous findings of Kumaraswami 
et al. (21), which demonstrated that 5' end truncations 
of the mepA operator affect MepR binding much more 
significantly than 3' end truncations, and proposed that 
the 5' end (the 1st) mepA site is the primary MepR 
binding site, while the 3' end (the 2nd) mepA site is a 
lower affinity, secondary site. Intriguingly, MepR binds 
to a 26 bp oligodeoxynucleotide encompassing only the 
second mepA site with high affinity (K d = 40 nM). Thus, 
our data show that MepR binding to an individual mepA 
2nd site is indistinguishable from binding to the mepR 
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operator = 36 nM, Figure 1) or full mepA operator 
(K d = 42 nM, Figure 9). In the context of the mepA 
operator, however, the binding of MepR to the 1st site 
is absolutely necessary for high affinity MepR binding to 
the 2nd site, which implies positive cooperativity in the 
overall MepR binding to the mepA operator. In accord, 
calculation of the Hill coefficient using the equation 
« H = 2 / [1+(K'/K) 1/2 ] (here K' is the dissociation 
constant for the intact mepA operator, and K is the dis- 
sociation constant for the mepA operator, in which the 1 st 
site was destroyed), previously employed by Ramos et al. 
(50) to study the cooperativity of binding of the TtgR 
repressor to its operator site, results in a value of 1.81 
supporting our hypothesis that binding shows positive 
cooperativity. It is likely, that this cooperativity is 
achieved via DNA-mediated allosteric effects (51), since 
the model of the MepR-mepA operator complex reveals 
no obvious direct interactions between the two MepR 
dimers. In order to test this hypothesis and provide a 
more detailed mechanism of MepR binding to the mepA 
operator, the structure of the MepR-mepA complex along 
with additional biochemical studies are necessary and 
beyond the scope of the current work. 

MarR proteins as global and local transcription regulators 

Several MarR proteins act as global regulators thereby 
affecting the expression of multiple genes. These include 
SarA from Staphylococcus aureus (52) and Enterococcus 
faecalis SlyA (15). On the other hand, some MarR family 
members regulate only one operon and are often called 
'local' regulators. These include B. subtilis OhrR (53), 
E. coli MarR (5) and Pseudomonas aeruginosa MexR 
(54). Recently, Dolan et al. (33) reported the structure of 
S. enterica global regulator SlyA bound to its cognate 
DNA and hypothesized that SlyA uses 'indirect readout' 
to identify its binding sites in numerous operator sites. The 
authors linked the small number of direct electrostatic 
base-amino acid side chain interactions between the recog- 
nition helix and the major groove (in the SlyA-DNA 
complex there is only one bidentate hydrogen bond 
between Arg65 and Gi 4 ) with the ability of the repressor 
to recognize multiple operator sites that are variable in 
length and sequence, and suggested that the lack of 
direct, specific protein-DNA interactions is a hallmark 
feature of global virulence regulators. Our structure of 
MepR in complex with its cognate DNA is inconsistent 
with this assumption, as there are no hydrogen bonds 
between the amino acid side chains of the recognition 
helix and the bases of the GTTAG signature sequence 
motif (Figures 3 and 5). Rather, the specificity of MepR 
binding is determined solely by the combination of van der 
Waals contacts, which are typically not powerful discrim- 
inators of DNA bases. In accord, a transcriptome analysis 
of a mepR deletion strain suggests that MepR does not 
mediate any regulatory effects outside of the mepRAB 
operon (G.W. Kaatz, unpublished data). Therefore, it is 
likely that the number of the specific protein-DNA major 
groove hydrogen bonds is not the only criterion to distin- 
guish between local and global regulators in the MarR 
family. It will be of great interest to discern those factors 



that would allow a robust prediction of the global or local 
gene regulatory functions of MarR family members. 

The organization of the mepR and mepA operators 

The structure of MepR in complex with mepR operator 
DNA and our DNA-binding data indicate that the GTTA 
G signature sequence is relatively insensitive to single sub- 
stitution mutations. It predicts that changing the last 
guanine to thymine or cytosine, or the third thymine to 
cytosine, producing the sequences GTTA J, GTTAC and 
GTCAG, will not introduce any unfavourable or remove 
any favourable protein-DNA contacts, and, theoretically, 
will not impair MepR binding. We have shown that even 
those mutations in the GTTAG motif, which remove one 
or two van der Waals contacts, do not affect the affinity of 
binding significantly (only a 2-3-fold increase in K d per 
change, Figure 6). Intriguingly, neither base of the first G- 
C base pair of the signature motif is contacted by the 
protein, which is concordant with the lack of conservation 
of this base pair in the two half-sites of the mepA operator 
(Figure 8). Nevertheless, the GTTAG signature sequence 
is very well preserved in the mepR and mepA operators 
((19), Figures 3 and 8). The reason for this preservation 
likely arises from the organization of the mepR and mepA 
upstream promoter regions, which have to provide not 
only high affinity binding sites for MepR, but also 
maintain optimal -35 and -10 promoter elements for 
RNA polymerase binding. Thus, in the mepR operator, 
the last guanine nucleotide in the first half-site (GTTAG, 
Figure 3) is also part of the TSS, while in the mepA 
operator, the -35 box, TAGTTA, overlaps significantly 
with the first half-site of the first mepA operator site 
(GTTAG, Figure 8). Thus, the signature sequence is main- 
tained in order to serve as binding sites for two proteins. 

Conclusions and perspectives 

In this work we provide the molecular details of the tran- 
scription repression mechanism of the multidrug efflux 
operon mepRA by the MarR regulator MepR. The struc- 
ture of MepR in complex with its mepR operator DNA 
reveals that although MepR binds DNA similarly to other 
MarR proteins, this repressor uses a unique mechanism of 
binding site recognition based solely on van der Waals 
interactions between the amino acid residues of the recog- 
nition helix and the bases of the major groove encompass- 
ing the GTTAG signature sequence. Using the MepR- 
mepR operator structure we modelled a MepR-me/7^4 
operator complex, which contains two MepR dimer- 
binding sites. This model suggests that the MepR dimers 
do not interact with each other in MepR-w7e;;^4 operator 
complex, and the observed cooperative binding of MepR 
to the second mepA operator site is likely caused by DNA- 
mediated allosteric effects (51). 

We also identified two protein-protein interaction 
networks that contribute to or stabilize high affinity 
DNA binding by MepR, which include the Hisl4':Arg59 
pair and ArglO':His35:Phel08 triad. In both networks, the 
residues interact via cation-7i or n-n interactions. In most 
crystal structures of MarR proteins bound to cognate ef- 
fectors (PcaV-protocatechuate (PDB accession code 4fht) 
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(30), ST1710-salicylate (PDB accession code 3gez) (22), 
MTH313-salicylate (PDB accession code 3bpx) (24), 
S. aureus MarR-kanamycin (PDB accession code 4emo), 
the ligand is located in a cleft surrounded by helices al, 
a2 and a5. By inference, we propose that residues ArglO 
(al), His35 (a2) and Phel08 (<x5) are involved in MepR- 
ligand interaction. The substrates of MepR are cationic 
lipophilic compounds and frequently possess delocalized 
7i-electron systems, which would allow them to interact 
with these residues and, hence, disrupt this triad. As a con- 
sequence of the disruption of this interaction network, the 
MepR-DNA affinity would be decreased (Supplementary 
Figure S3A) and lead to the induction of MepR. In accord, 
residue Argl5 of 5. coelicolor PcaV, which corresponds to 
the MepR residue ArglO, is critical for both ligand and 
DNA binding (30). Structures of MepR-ligand complexes 
will be necessary to confirm the importance of ArglO and 
other residues of the triad and the Hisl4':Arg59 pair in 
drug binding and induction. 
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